Acquiring Naturalistic Concept Descriptions from the Web
نویسندگان
چکیده
Many of the beliefs that one uses to reason about everyday entities and events are neither strictly true or even logically consistent. Rather, people appear to rely on a large body of folk knowledge in the form of stereotypical associations, clichés and other kinds of naturalistic descriptions, many of which express views of the world that are second-hand, overly-simplified and, in some cases, non-literal to the point of being poetic. These descriptions pervade our language yet one rarely finds them in authoritative linguistic resources like dictionaries and encyclopaedias. We describe here how such naturalistic descriptions can be harvested from the web in the guise of explicit similes and related text patterns, and empirically demonstrate that these descriptions do broadly capture the way people see the world, at least from the perspective of category organization in an ontology.
منابع مشابه
Health System Resilience: What Are We Talking About? A Scoping Review Mapping Characteristics and Keywords
Background Health systems are based on 6 functions that need to work together at all times to effectively deliver safe and quality health services. These functions are vulnerable to shocks and changes; if a health system is unable to withstand the pressure from a shock, it may cease to function or collapse. The concept of resilience has been introduced with the goal of strengthening healt...
متن کاملMultiple Convergence: An Approach to Disjunctive Concept Acquisition
Multiple convergence is proposed as a method for acquiring disjunctive concept descriptions. Disjunctive descriptions are necessary when the concept representation language is insufficiently expressive to satisfy the completeness and consistency requirements of inductive learning with a single conjunction of generalized features. Multiple convergence overcomes this insufficiency by allowing the...
متن کاملConcept Learning and Categorization from the Web
In previous work, we found that a great deal of information about noun attributes can be extracted from the Web using simple text patterns, and that enriching vector-based models of concepts with this information about attributes led to drastic improvements in noun categorization. We extend this previous work in two ways: (i) by comparing concept descriptions extracted using patterns with descr...
متن کاملExtracting concept descriptions from the Web: the importance of attributes and values
When extracting information about concepts from the Web, the problem is not recall, but precision: trying to identify which properties of a concept are genuinely distinctive. We discuss a series of experiments in empirical ontology using both unsupervised and supervised methods, showing that not all semantic relations we can extract from text are equally useful, and suggesting that attempting t...
متن کاملWeb-Based Semantic Pervasive Computing Services
Pervasive Computing refers to a seamless and invisible computing environment which provides dynamic, proactive and context-aware services to the user by acquiring context knowledge from the environment and composing available services. In this paper, we demonstrate how heterogeneous Web services can be made interoperable and used to support Pervasive Computing. We present an architecture how a ...
متن کامل